Efficiency of Pregroups and the French Noun Phrase

نویسندگان

Sylvain Degeilh

Anne Preller

چکیده

We study mathematical and algorithmic properties of Lambek’s pregroups and illustrate them by the French noun phrase. An algorithm of complexity 3 n to solve the reduction problem in an arbitrary free pregroup as well as recognition by a pregroup grammar is presented. This algorithm is then specified to run in linear time. A sufficient condition for a language fragment that makes the linear algorithm complete is given. Introduction Pregroups are a recent mathematical tool introduced in [Lambek 99] for natural language processing. They arose as a simplification of the Lambek calculus, first published under the name of Syntactic Calculus in [Lambek 58]. A pregroup grammar consists of a free pregroup and a pregroup dictionary, i.e. a finite relation which associates finitely many elements of the pregroup, the so-called types, to each word. [Buszkowski] has shown that pregroup grammars are weakly equivalent to context-free grammars. Well known algorithms like [CYK] or [Earley] for context-free grammars solve the problem of the grammaticality of strings of words in time proportional to the cube of the length of the string. However, it would not only be clumsy and inefficient to translate a pregroup grammar to a context-free grammar (this grammar would include the whole dictionary in its set of rules), but it would also go against the pregroups spirit. Indeed, pregroups were conceived as a tool which “processes the words as you hear them”. For each word you hear, you choose a type from the (mental) dictionary. The grammaticality of a string of words from the dictionary is then decided by performing computations on the corresponding string of types in the pregroup. So far, much of the work with pregroups has been aimed at establishing the universality of the rules, which are the same whatever the language only the dictionary changes. Fragments of Arabic, Japanese, Latin [MathLing], Polish [Kislak], English [Lambek 99, 03], French [Barg Lamb], German [Lamb-Prel NP, SE], Italian [Cas-Lamb] among others have been analysed with the help of pregroups.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

1 Sylvain

We study mathematical and algorithmic properties of Lambek’s pregroups and illustrate them by analysing the French noun phrase. We establish robustness properties of pregroups and present a simple algorithm of complexity 3 n for deciding reduction in an arbitrary free pregroup as well as a linear algorithm covering a large class of language fragments. In the French noun phrase, the agreement of...

متن کامل

Investigating Embedded Question Reuse in Question Answering

The investigation presented in this paper is a novel method in question answering (QA) that enables a QA system to gain performance through reuse of information in the answer to one question to answer another related question. Our analysis shows that a pair of question in a general open domain QA can have embedding relation through their mentions of noun phrase expressions. We present methods f...

متن کامل

Do Heavy-NP Shift Phenomenon and Constituent Ordering in English Cause Sentence Processing Difficulty for EFL Learners?

Heavy-NP shift occurs when speakers prefer placing lengthy or “heavy” noun phrase direct objects in the clause-final position within a sentence rather than in the post-verbal position. Two experiments were conducted in this study, and their results suggested that having a long noun phrase affected the ordering of constituents (the noun phrase and prepositional phrase) by advanced Iranian EFL le...

متن کامل

بررسی تأثیر پیچیدگی نحوی ساختار گروه اسمی ‌و فعلی بر وقوع لکنت در کودکان لکنتی پیش‌دبستانی ۶-۴ سال فارسی‌زبان

Objective: The purpose of the present research was to investigate the effect of syntactic complexity of noun phrase and verb phrase on the occurrence of stuttering in 4-6 year Persian speaking children with stuttering. Materials & Methods: This descriptive-analytic research was done on 15 stuttering children, consisting of 12 boys and 3 girls, 4 to 6 years old monolingual Persian speaking wh...

متن کامل

An Algorithm for finding Noun Phrase Correspondences in Bilingual Corpora

94304 Abstract The paper describes an algorithm that employs English and French text taggers to associate noun phrases in an aligned bilingual corpus. The tag-gets provide part-of-speech categories which are used by finite-state recognizers to extract simple noun phrases for both languages. Noun phrases are then mapped to each other using an iterative re-estimation algorithm that bears similari...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Journal of Logic, Language and Information

دوره 14 شماره

صفحات -

تاریخ انتشار 2005

Efficiency of Pregroups and the French Noun Phrase

نویسندگان

چکیده

منابع مشابه

1 Sylvain

Investigating Embedded Question Reuse in Question Answering

Do Heavy-NP Shift Phenomenon and Constituent Ordering in English Cause Sentence Processing Difficulty for EFL Learners?

بررسی تأثیر پیچیدگی نحوی ساختار گروه اسمی ‌و فعلی بر وقوع لکنت در کودکان لکنتی پیش‌دبستانی ۶-۴ سال فارسی‌زبان

An Algorithm for finding Noun Phrase Correspondences in Bilingual Corpora

عنوان ژورنال:

اشتراک گذاری